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WHAT IS CLAIMED IS: 



1 1 . A method of identifying a zinc finger domain that recognizes a target site on a 

2 DNA, the method comprising: 

3 (a) providing cells containing a reporter construct, the construct 

4 comprising a reporter gene operably linked to a promoter, wherein the reporter gene is 

5 expressed above a given level when a transcription factor recognizes both a recruitment site 

6 and a target site of the promoter, but not when the transcription factor recognizes only the 

7 recruitment site of the promoter; 

8 (b) providing a plurality of hybrid nucleic acids, each of which encodes a 

9 non-naturally occurring protein comprising (i) a transcription activation domain, (ii) a DNA 

10 binding domain that recognizes the recruitment site, and (iii) a test zinc finger domain, 

1 1 wherein the encoded amino acid sequence of the test zinc finger domain varies among the 

j>n hi 

12 members of the plurality; 

y} 13 (c) contactingthepluralityofhybrid nucleic acids with the cells under 

i-> ^ ii 

III 14 conditions that permit at least one of the plurality of nucleic acids to enter at least one of the 

^11 15 cells; 

■< 

f II 1 6 (d) maintaining the cells under conditions permitting expression of the 

f 17 hybrid nucleic acids in the cells; and 

18 (e) identifying a cell that contains a hybrid nucleic acid of (b) and that 

m 19 expresses the reporter gene above the given level as an indication that the cell contains a 

2 20 hybrid nucleic acid encoding a test zinc finger domain that recognizes the target site. 

1 2. The method of claim 1, wherein the cells are eukaryotic cells. 

1 3. The method of claim 2, wherein the cells are yeast cells. 

1 4. The method of claim 3, wherein the cells are Saccharomyces cerevisiae cells. 

1 5. The method of claim 1, wherein the reporter gene is a selectable marker. 
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1 6. The method of claim 5, wherein the selectable marker is selected from the group 

2 consisting of URA3, HISS, LEU2, ADEl and TRPL 

1 7. The method of claim 1, wherein the reporter gene is selected from the group 

2 consisting of lacZ, CAT, luciferase, GUS, and GFP. 

1 8. The method of claim 1, wherein the DNA binding domain comprises a zinc finger 

2 domain. 

1 9. The method of claim 8, wherein the DNA binding domain comprises two zinc 

2 finger domains. 

1 10. The method of claim 9, wherein the DNA binding domain comprises three zinc 

^rt 2 finger domains. 

L ;: 
*^ '*. 

n\ 

Tt ■< 
w ^ IP 

[f! 1 11. The method of claim 1 , further comprising the steps of (i) amplifying a source 

\ 2 nucleic acid encoding the test zinc finger domain from genomic nucleic acid, a messenger 

fll 3 RNA (mRNA) mixture, or a complementary DNA (cDNA) mixture, using an oligonucleotide 

4 primer that anneals to a sequence encoding a conserved domain boundary to produce an 

I ^ 5 amplified fragment; and (ii) utilizing the amplified fragment to construct a hybrid nucleic 

[f| 6 acid for inclusion in the plurality of hybrid nucleic acids of step (b). 

1 12. The method of claim 1 , further comprising the steps of (i) identifying a candidate 

2 zinc finger domain amino acid sequence in a sequence database; (ii) providing a candidate 

3 nucleic acid encoding the candidate zinc finger domain amino acid sequence, and (iii) 

4 utilizing the candidate nucleic acid to construct a hybrid nucleic acid for inclusion in the 

5 plurality of hybrid nucleic acids of step (b). 

1 13. The method of claim 5, wherein the selectable marker is an auxotrophy gene 

2 required for the synthesis of a metabolite; the genome of the cells lacks a functional copy of 

3 the auxotrophy gene; and, during step (d), the cells are maintained in a medium prepared 

4 without the metabolite. 
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1 14. The method of claim 1, wherein steps (a) to (f) are repeated to identify a second 

2 test zinc finger domain that recognizes a second target site. 

1 15. The method of claim 14, fiirther comprising constructing a nucleic acid encoding 

2 a polypeptide comprising the first test zinc finger domain and the second test zinc finger 

3 domain. 

1 16. A method of identifying a zinc finger domain that recognizes a target site on a 

2 DNA, the method comprising: 

3 (a) providing cells containing a reporter construct, the construct comprising a 

4 reporter gene operably linked to a promoter, wherein the reporter gene is expressed above a 

5 given level when a transcription factor recognizes both a recruitment site and a target site of 
e the promoter, but not when the transcription factor recognizes only the recruitment site of the 

'■J 7 promoter; 

m 8 (b) amplifying a plurality of nucleic acid sequences, each of which encodes a test 

>H ^ " 

!=■ J 9 zinc finger domain, using an oligonucleotide primer that aimeals to a nucleic acid encoding a 

fll 10 conserved domain boundary; 

%^ 1 1 (c) joining each nucleic acid sequence of (b) to nucleic acid sequences encoding 

Kl 12 (i) a transcription activation domain, and (ii) a DNA binding domain that recognizes the 

m 13 recruitment site, to form a plurality of hybrid nucleic acids; 

f*i ■ 

Jf 14 (d) contacting the plurality of hybrid nucleic acids of (c) with the cells of (a) 

15 under conditions that permit at least one of the plurality of hybrid nucleic acids to enter at 

1 6 least one of the cells; 

17 (e) maintaining the cells under conditions permitting expression of the hybrid 

18 nucleic acids in the cells; and 

19 (f) identifying a cell that contains a hybrid nucleic acid of (c) and that expresses 

20 the reporter gene above the given level, wherein the hybrid nucleic acid encodes a zinc finger 

21 domain that recognizes the target site on a DNA. 

1 17. The method of claim 1 6, wherein the cells are yeast cells. 
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1 18. The method of claim 1 6, wherein the reporter gene is selected from the group 

2 consisting of lacZ, CAT, luciferase, GUS, and GFP. 

1 19. The method of claim 16, wherein the DNA binding domain comprises a zinc 

2 finger domain. 

1 20. The method of claim 1 9, wherein the DNA binding domain comprises two zinc 

2 finger domains. 

1 2 1 . A method of determining whether a test zinc finger domain recognizes a target 

2 site on a promoter, the method comprising: 

3 (a) providing a reporter construct comprising a reporter gene operably 
■"i 4 linked to a promoter, wherein the reporter gene is expressed above a given level when a 
H 6 transcription factor recognizes both a recruitment site and a target site of the promoter, but 
m 6 not when the transcription factor recognizes only the recruitment site of the promoter; 

7 (b) providing a hybrid nucleic acid that encodes a non-naturally occurring 

s: 8 protein comprising (i) a transcription activation domain, (ii) a DNA binding domain that 

St 9 recognizes the recruitment site, and (iii) a test zinc finger domain; 

CP 10 (c) contacting the reporter construct with a cell under conditions that 

ii X 1 1 permit the reporter construct to enter the cell; 

12 (d) prior to, after, or concurrent with step (c), contacting the hybrid 

13 nucleic acid with the cell under conditions that permit the hybrid nucleic acid to enter the 

14 cell; 

15 (e) maintaining the cell under conditions permitting expression of the 

16 hybrid nucleic acid in the cell; and 

1 7 (f) detecting reporter gene expression in the cell, wherein a level of 

1 8 reporter gene expression greater than the given level is an indication that the test zinc finger 

1 9 domain recognizes the target site. 
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1 22. The method of claim 21 , further comprising the step of amphfying a nucleic acid 

2 encoding the test zinc finger domain from genomic DNA, an mRNA mixture or a cDNA 

3 mixture using an oligonucleotide primer that anneals to a sequence encoding a conserved 

4 domain boundary. 

5 23 . The method of claim 2 1 , further comprising the steps of (i) identifying a 

6 candidate zinc finger domain amino acid sequence in a sequence database; (ii) providing a 

7 candidate nucleic acid encoding the candidate zinc finger domain amino acid sequence, and 

8 (iii) utilizing the candidate nucleic acid to construct a hybrid nucleic acid for inclusion in the 

9 plurality of hybrid nucleic acids of step (b). 

I 24. A method of determining v^hether a test zinc finger domain recognizes a target 
% 2 site on a promoter, the method comprising: 

H 3 (a) providing a first cell comprismg a reporter construct comprising a 

iii, ii„i; 

[ri 4 reporter gene operably linked to a promoter, wherein the reporter gene is expressed above a 

r \ 5 given level when a transcription factor recognizes both a recruitment site and a target site of 

rlJ 6 the promoter, but not when the transcription factor recognizes only the recruitment site of the 

»^ 

fi 7 promoter; 

8 (b) providing a second cell comprising a hybrid nucleic acid that encodes 

5:: 9 a protein comprising (i) a transcription activation domain, (ii) a DNA binding domain that 

h^^ 10 recognizes the recruitment binding site, and (iii) a test zinc finger domain; 

I I (c) fusing the first and second cells to form a fused cell; 

1 2 (d) maintaining the fused cell under conditions permitting expression of 

13 the hybrid nucleic acids in the cell; and 

14 (e) detecting reporter gene expression in the fused cell, wherein a level of 

15 reporter gene expression greater than the given level is an indication that the test zinc finger 

1 6 domain recognizes the target site. 

1 25. The method of claim 24 wherein the first and second cells are yeast cells of the 

2 opposite mating types. 
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26. A method of determining whether a test zinc finger domain recognizes a target 
site on a promoter, the method comprising: 

(a) providing a plurality of reporter constructs, each construct comprising 
a reporter gene operably linked to a promoter, wherein the reporter gene is expressed above a 
given level when a transcription factor recognizes both a recruitment site and a target site of 
the promoter, but not when the transcription factor recognizes only the recruitment site of the 
promoter; 

(b) providing a cell containing a hybrid nucleic acid, that encodes a non- 
naturally occurring protein comprising (i) a transcription activation domain, (ii) a DNA 
binding domain that recognizes the recruitment site, and (iii) a test zinc finger domain; 

(c) contacting the plurality of reporter constructs with the cell under 
conditions that permit at least one of the plurality of reporter constructs to enter the cell; 

(d) maintaming the cell under conditions permitting expression of the 
hybrid nucleic acid in the cell; and 

(e) identifying a cell that contains a reporter gene of (a) and that expresses 
the reporter gene above the given level as an indication that the reporter construct in the cell 
comprises a target site recognized by the test zinc finger domain. 

27. The method of claim 26, wherein the target binding site is between two and six 
nucleotides long. 

28. The method of claim 27, wherein the plurality of reporter constructs comprises 
every possible combination of A, T, G, and C nucleotides at at least two positions of the 
target binding site. 

29. The method of claim 28, wherein the plurality of reporter constructs comprises 
every possible combination of A, T, G, and C nucleotides at at least three positions of the 
target binding sites. 
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30. The method of claim 26, wherein steps (a) to (e) are repeated for a second test 
zinc finger domain to identify a second binding preference. 

3 1 . The method of claim 30, further comprising constructing a nucleic acid encoding 
a polypeptide comprising the first second test zinc finger domains. 

32. A method of identifying a plurality of zinc finger domains, the method 
comprising: 

carrying out the method of claim 1 to identify a first test zinc finger domain; and 

carrying out the method of claim 1 again to identify a second test zinc finger domain 
that recognizes a target site different from the target site recognized by the first test zinc 
finger domain. 

33. A method of generating a nucleic acid encoding a chimeric zinc finger protein, 
the method comprising: 

carrying out the method of claim 32; 

constructing a nucleic acid encoding a polypeptide comprising the first and 
second test zinc finger domains. 

34. A method of identifying DNA sequences recognized by zinc finger domains, the 
method comprising: 

carrying out the method of claim 24 to identify a first target site recognized by a first 
test zinc finger domain; and 

carrying out the method of claim 24 again to identify a second target site recognized 
by a second test zinc finger domain. 

35. A method of generating a nucleic acid encoding a chimeric zinc finger protein, 
the method comprising: 
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3 carrying out the method of claim 34; 

4 constructing a nucleic acid encoding a polypeptide comprising the first and 

5 second test zinc finger domains. 

1 36. A purified polypeptide comprising the amino acid sequence: 

2 Xa-X-Cys-X2-5-Cys-X3-Xa-X-Cys-X-Ser-Asn-Xb-X-Arg-His-X3-5-His 

3 (SEQIDNO:68X 

4 wherein Xa is phenylalanine or tyrosine, and Xb is a hydrophobic residue. 
1 37. A nucleic acid comprising a sequence encoding the polypeptide of claim 36. 



38. A purified polypeptide comprising the amino acid sequence: 
Xa-X-Cys-X2.5-Cys-X3-Xa-X-His-X-Ser-Asn-Xb-X-Lys-His-X3.5-His 

(SEQ ID NO:69), 

wherein Xa is phenylalanine or tyrosine, and Xb is a hydrophobic residue. 

39. A nucleic acid comprising a sequence encoding the polypeptide of claim 38. 

40. A purified polypeptide comprising the amino acid sequence: 
Xa-X-Cys-X2-5-Cys-X3-Xa-X-Ser-X-Ser"Asn-Xb-X-Arg-His-X3-5-His 

(SEQ IDNO:70), 

wherein Xa is phenylalanine or tyrosine, and Xb is a hydrophobic residue. 



1 41 . A nucleic acid comprising a sequence encoding the polypeptide of claim 40. 

1 42. A purified polypeptide comprising the amino acid sequence: 

2 Xa-X-Cys-X2-5-Cys-X3-Xa-X-Gln-X-Ser-Thr-Xb-X-Val-His-X3.5-His 

3 (SEQIDN0:71), 

4 wherein Xa is phenylalanine or tyrosine, and Xb is a hydrophobic residue. 

1 43. A nucleic acid comprising a sequence encoding the polypeptide of claim 42. 

1 44. A purified polypeptide comprising the amino acid sequence: 
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Xa-X-Cys-X2-5-Cys-X3-Xa-X-Val-X.Ser-Xc-Xb-X-Arg-His-X3.5-H^ (SEQ ID NO:72), 
wherein Xa is phenylalanine or tyrosine, Xb is a hydrophobic residue, and Xc 
is serine or threonine. 

45. A nucleic acid comprising a sequence encoding the polypeptide of claim 44. 

46. A purified polypeptide comprising the amino acid sequence: 

Xa-X-Cys-X2.5-Cys-X3-Xa-X-Gln-X-Ser-His-Xb-X-Arg-His-X3.5-His 
(SEQ ID NO:73), 

wherein Xa is phenylalanine or tyrosine, and Xb is a hydrophobic residue. 

47. A nucleic acid comprising a sequence encoding the polypeptide of claim 46. 

48. A purified polypeptide comprising the amino acid sequence: 

Xa-X-Cys-X2.5-Cys-X3-Xa-X-Gln-X-Ser-Asn-Xb-X-Val-His-X3.5-His 
(SEQ IDNO:74), 

wherein Xa is phenylalanine or tyrosine, and Xb is a hydrophobic residue. 

49. A nucleic acid comprising a sequence encoding the polypeptide of claim 48. 

50. A purified polypeptide comprising the amino acid sequence: 
Xa-X-Cys-X2.5-Cys-X3-Xa-X-Gln-X-Ser-Xc-Xb-X-Arg-His-X3-5-His 

(SEQ ID NO:75), 

wherein Xa is phenylalanine or tyrosine, Xb is a hydrophobic residue, and Xc 
is serine or threonine. 

5 1 . A nucleic acid comprising a sequence encoding the polypeptide of claim 50. 

52. A purified polypeptide comprising an amino acid sequence 60% identical to SEQ 
IDNO:65. 

53. A nucleic acid comprising a sequence encoding the polypeptide of claim 52. 
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54. A purified polypeptide, comprising an amino acid sequence 60% identical to an 
amino acid sequence selected from the group consisting of: SEQ ID NO:29, 127, 129, 131, 
133, and 135. 

55. A nucleic acid, comprising a sequence encoding the polypeptide of claim 54. 

56. A purified polypeptide comprising the amino acid sequence: 
Xa-X-Cys-X2.5-Cys-X3-Xa-X-Gln-X-Ala-His-Xb-X-Arg-His-X3.5-His 

(SEQ IDNO:150), 

wherein Xa is phenylalanine or tyrosine, and Xb is a hydrophobic residue. 

57. A nucleic acid comprising a sequence encoding the polypeptide of claim 56. 

58. A purified polypeptide comprising the amino acid sequence: 
Xa-X-Cys-X2.5"Cys-X3-Xa-X-Gln-X-Phe-Asn-Xb-X-Arg-His-X3-5-His 

(SEQ ID NO: 151), 

wherein Xa is phenylalanine or tyrosine, and Xb is a hydrophobic residue. 

59. A nucleic acid comprising a sequence encoding the polypeptide of claim 58. 

60. A purified polypeptide comprising the amino acid sequence: 
Xa-X-Cys-X2-5-Cys-X3-Xa-X-Gln-X-Ser-His-Xb-X-Thr-His-X3-5-His 

(SEQ ID NO: 152), 

wherein Xa is phenylalanine or tyrosine, and Xb is a hydrophobic residue. 

61 . A nucleic acid comprising a sequence encoding the polypeptide of claim 60. 

62. A pxmfied polypeptide comprising the amino acid sequence: 
Xa-X-Cys-X2.5-Cys-X3-Xa-X-Gln-X-Ser-His-Xb-X-Val-His-X3-5-His 

(SEQ ID NO: 153), 

wherein Xa is phenylalanine or tyrosine, and Xb is a hydrophobic residue. 
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63. A nucleic acid comrising a sequence encoding the polypeptide of claim 62. 

64. A purified polypeptide comprising the amino acid sequence: 
Xa-X-Cys-X2.5-Cys-X3-Xa-X-Gln-X-Ser-Asn-Xb-X-Ile-His-X3-5-His 

(SEQ ID NO: 154), 

wherein Xa is phenylalanine or tyrosine, and Xb is a hydrophobic residue. 

65. A nucleic acid comprising a sequence encoding the polypeptide of claim 64. 

66. A purified polypeptide comprising the amino acid sequence: 

Xa-X-Cys-X2-5-Cys-X3"Xa-X-Gln-X-Ser-Asn-Xb-X-Arg-His-X3.5-His 

(SEQ IDNO:155X 

wherein Xa is phenylalanine or tyrosine, and Xb is a hydrophobic residue. 



67. A nucleic acid comprising a sequence encoding the polypeptide of claim 66. 

68. A purified polypeptide comprising the amino acid sequence: 
Xa-X-Cys-X2.5-Cys-X3-Xa-X-Gln-X-Thr-His-Xb-X-Gln-His-X3-5-His 

(SEQ ID NO: 156), 

wherein Xa is phenylalanine or tyrosine, and Xb is a hydrophobic residue. 

69. A nucleic acid comprising a sequence encoding the polypeptide of claim 68. 

70. A purified polypeptide comprising the amino acid sequence: 

Cys-X2-5-Cys-X3-Xa-X-Gln-X-Thr-His-Xb-X-Arg-His-X3.5-His 

(SEQ ID NO: 157), 

wherein Xa is phenylalanine or tyrosine, and Xb is a hydrophobic residue. 

71. A nucleic acid comprising a sequence encoding the polypeptide of claim 70. 
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1 72. A purified polypeptide comprising the amino acid sequence: 

2 Xa-X-Cys-X2-5-Cys-X3-Xa-X-Arg-X-Asp-Lys-Xb-X-Ile-ffis-X3-5-His 

3 (SEQIDNO:158), 

4 wherein Xa is phenylalanine or tyrosine, and Xb is a hydrophobic residue. 
1 73. A nucleic acid comprising a sequence encoding the polypeptide of claim 72. 

1 74. A purified polypeptide comprising the amino acid sequence: 

2 Xa-X-Cys-X2-5-Cys-X3-Xa-X-Arg-X-Ser-Asn-Xb-X-Arg-His-X3-5-His 

3 (SEQIDNO:159), 

4 wherein Xa is phenylalanine or tyrosine, and Xb is a hydrophobic residue. 
1 75 . A nucleic acid comprising a sequence encoding the polypeptide of claim 74. 

Ml 'H 

^0 1 76. A purified polypeptide comprising an amino acid sequence 60% identical to SEQ 

ra 2 ID NO: 141. 

i# i< -1 

J 1 77. A nucleic acid comprising a sequence encoding the polypeptide of claim 76. 

1 78. A purified polypeptide comprising an amino acid sequence 60% identical to SEQ 

fy 2 ID NO: 107. 

^^^^ 

HI 

1 79. A nucleic acid comprising a sequence encoding the polypeptide of claim 78. 

1 80. A purified polypeptide comprising an amino acid sequence 60% identical to SEQ 

2 IDNO:137. 
3 

1 8 1 . A nucleic acid comprising a sequence encoding the polypeptide of claim 80. 

1 82. A purified polypeptide comprising an amino acid sequence 60% identical to SEQ 

2 IDNO:145. 

1 83. A nucleic acid comprising a sequence encoding the polypeptide of claim 82. 
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84. A purified polypeptide comprising an amino acid sequence 60% identical to SEQ 
ID NO: 149. 

85. A nucleic acid comprising a sequence encoding the polypeptide of claim 84. 
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